Correcting for ascertainment bias in the inference of population structure
نویسندگان
چکیده
BACKGROUND The ascertainment process of molecular markers amounts to disregard loci carrying alleles with low frequencies. This can result in strong biases in inferences under population genetics models if not properly taken into account by the inference algorithm. Attempting to model this censoring process in view of making inference of population structure (i.e.identifying clusters of individuals) brings up challenging numerical difficulties. METHOD These difficulties are related to the presence of intractable normalizing constants in Metropolis-Hastings acceptance ratios. This can be solved via an Markov chain Monte Carlo (MCMC) algorithm known as single variable exchange algorithm (SVEA). RESULT We show how this general solution can be implemented for a class of clustering models of broad interest in population genetics that includes the models underlying the computer programs STRUCTURE, GENELAND and GESTE. We also implement the method proposed for a simple example and show that it allows us to reduce the bias substantially. AVAILABILITY Further details and a computer program implementing the method are available from http://folk.uio.no/gillesg/AscB/.
منابع مشابه
A Details on the Multinomial-dirichlet Model with Correlated Allele Frequencies
Supplementary material for Bioinformatics Application Note: “Correcting for ascertainment bias in the inference of population structure.” Gilles Guillot 1∗ , Matthieu Foll 2,3 1 Centre for Ecological and Evolutionary Synthesis, Department of Biology, University of Oslo. P.O. Box 1066 Blindern 0316 Oslo Norway. 2 Computational and molecular population genetics lab, Institute of Ecology and Evolu...
متن کاملBayesian approach to inference of population structure
Methods of inferring the population structure, its applications in identifying disease models as well as foresighting the physical and mental situation of human beings have been finding ever-increasing importance. In this article, first, motivation and significance of studying the problem of population structure is explained. In the next section, the applications of inference of p...
متن کاملThe discovery of single-nucleotide polymorphisms--and inferences about human demographic history.
A method of historical inference that accounts for ascertainment bias is developed and applied to single-nucleotide polymorphism (SNP) data in humans. The data consist of 84 short fragments of the genome that were selected, from three recent SNP surveys, to contain at least two polymorphisms in their respective ascertainment samples and that were then fully resequenced in 47 globally distribute...
متن کاملAscertainment bias in spatially structured populations: a case study in the eastern fence lizard.
Despite increased interest in applying single nucleotide polymorphism (SNP) data to questions in natural systems, one unresolved issue is to what extent the ascertainment bias induced during the SNP discovery phase will impact available analysis methods. Although most studies addressing ascertainment bias have focused on human populations, it is not clear whether existing methods will work when...
متن کاملEstimating population divergence time and phylogeny from single-nucleotide polymorphisms data with outgroup ascertainment bias.
The inference of population divergence times and branching patterns is of fundamental importance in many population genetic analyses. Many methods have been developed for estimating population divergence times, and recently, there has been particular attention towards genome-wide single-nucleotide polymorphisms (SNP) data. However, most SNP data have been affected by an ascertainment bias cause...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Bioinformatics
دوره 25 4 شماره
صفحات -
تاریخ انتشار 2009